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It  has  been  a  major  objective  of  this  investigation  to  determine  means 
by  which  data  from  an  unknown  multivariate  probability  distribution  may  be 
most  expeditiously  summarized,  particularly  in  the  exploratory  phase  of  the 
data  analysis  process.  Unlike  the  members  of  exploratory  data  analysis  (EDA) 
school  who  attempt  to  avoid  models,  our  analysis  is  oriented  toward  the 
ultimate  goal  of  modelling  the  stochastic  mechanism  at  hand.  [1]  gives  a 
survey  of  some  of  the  problems  outstanding  in  density  estimation. 

In  [2]  we  give  our  algorithm  for  discrete  maximum  penalized  likelihood 
estimation  of  a  one  dimensional  density  nonparametrically.  This  algorithm 
is  now  one  of  the  most  used  for  nonparametric  probability  density  estimation — 
having  been  made  a  part  of  the  IMSL  software  library  (NDMPLE).  It  should  be 
mentioned  that  this  routine,  as  a  part  of  the  IMSL  package,  is  on  disk  at 
practically  all  DOD  computer  centers. 

In  [3],  three  basic  approaches  are  investigated  for  estimating  two 
dimensional  densities.  Computer  software  is  developed  for  implementing  each 
of  the  approaches.  We  note  that  the  two  dimensional  density  estimation  prob¬ 
lem  is  actually  a  three  dimensional  problem  from  a  representational  stand¬ 
point.  Accordingly,  for  higher  dimensions  simple  graphing  of  the  density 
is  not  possible. 

[4]  examines  the  nonparametric  estimation  of  three  dimensional  densities 
The  nonion  peel"  algorithm  is  developed  for  this  task.  By  this  procedure, 
we  first  find  the  modes  of  the  density  and  then  use  these  as  centers  for 
further  investigation. 

A  problem  posed  by  Dr.  Malcolm  Taylor  of  the  Aberdeen  Proving  Ground 
Involves  large  scale  simulations  in  which  a  multidimensional  data  set  is 
to  be  used  as  a  base  for  a  pseudorandom  number  generator.  An  algorithm 
[5]  was  developed  for  this  propose  which  avoids  the  necessity  of  esti¬ 
mating  the  underlying  density.  It  is  now  operational  both  at  Aberdeen  and 


at  Rice. 
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[6]  gives  a  parametric  analysis  of  breast  cancer  data  in  an  attempt  to  ' 

model  the  progress  of  the  disease  in  a  large  population  of  patients,  A 
tentative  conclusion  is  made  that  in  a  significant  proportion  of  cases  of 
the  disease  the  five  year  disease  free  equivalence  of  a  cure  does  not  appear 
valid.  In  [7],  we  discuss  a  model -theoretic  approach  for  the  optimal 

scheduling  of  chemotherapeutic  regimes.  j 

i 

( 

The  technique  of  maximum  penalized  likelihood  is  employed  in  [8]  for 
the  nonparametric  estimation  of  the  intensity  of  metastasis  formation.  Con¬ 
trary  to  classical  neoplastic  theory,  we  discovered that  the  appearance  of 
new  metastases  is  essentially  constant  in  time.  | 

Based  on  the  exploratory  data  analysis  in  [8],  we  developed  in  [9]  a  j 

j  i 

stochastic  model  for  the  progress  of  cancer  and  used  it  to  analyze  a  number 

of  data  sets  from  several  different  kinds  of  cancer.  Our  tentative  conclusion  j 

is  that  cancer  appears  to  proceed  as  a  systemic  disease  with  the  throwing  off 

of  metastases  relegated  to  a  secondary  function.  This  result  gives  rise  to 

new  speculation  as  to  the  causes  and  treatment  of  cancer. 
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